Extracting Unstructured Information from the WWW to Support Merchant Existence in eCommerce

نویسندگان

  • Farid Meziane
  • Mohd Khairudin Kasiran
چکیده

In [KM02]a model has been developed to support trust in eCommerce. The model is composed of four main modules where each module is a set of factors the consumer is looking for to trust a virtual merchant. These four modules represent the merchant existence, affiliation, policy and performance. In this paper, we present a model to implement the existence module by developing an information extraction system which aims at localising the required information on the merchant’s website. The system is based on rules that reflect the different ways the information is represented on the websites, their structures and the layout of the websites. The extracted information is then stored in a database to be used for a future evaluation of the trust associated with the merchant website.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Information Framework for a Merchant Trust Agent in Electronic Commerce

eCommerce is a faceless business arrangement where the process of creating trust towards merchants, hereby referred to as “merchant trust”, is still a big challenge. Merchant trust framework can be created by using several factors such as existence (people, physical, and registration), affiliation (third party endorsement, membership, and portal), performance (delivery, payment and community co...

متن کامل

Standardization of Unstructured Textual Data into Semantic Web Format

Analysis done on the nature of the data posted on the World Wide Web (WWW) reveal that more than 80% of the data over the WWW is in unstructured text format. Hence extracting information from text is of paramount importance both for academic and business purposes. Simultaneously, evolution of web technology led to the novel concept of Semantic Web, which is an extension of the current web in wh...

متن کامل

A Framework For Extracting Information From Web Using VTD-XML‘s XPath

The exponential growth of WWW (World Wide Web) is the cause for vast pool of information as well as several challenges posed by it, such as extracting potentially useful and unknown information from WWW. Many websites are built with HTML, because of its unstructured layout, it is difficult to obtain effective and precise data from web using HTML. The advent of XML (Extensible Markup Language) p...

متن کامل

Challenging Issues and Similarity Measures for Web Document Clustering

Web itself contains a large amount of documents available in electronic form. The available documents are in various forms and the information in them is not in organized form. The lack of organization of materials in the WWW motivates people to automatically manage the huge amount of information. Textmining refers generally to the process of extracting interesting and non-trivial information a...

متن کامل

iDocument: Using Ontologies for Extracting and Annotating Information from Unstructured Text

Due to the huge amount of text data in the WWW, annotating unstructured text with semantic markup is a crucial topic in Semantic Web research. This work formally analyzes the incorporation of domain ontologies into information extraction tasks in iDocument. Ontologybased information extraction exploits domain ontologies with formalized and structured domain knowledge for extracting domain-relev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003